ABRIR at NTCIR-9 GeoTime Task Usage of Wikipedia and GeoNames for Handling Named Entity Information

نویسنده

  • Masaharu Yoshioka
چکیده

 ABRIR at NTCIR-8 Approach C t ti f N d E tit d t b Construction of Boolean query by using pseudo relevant documents Named entity is more important than others Verb synonym list is used for increasing coverage Combination of Boolean IR model and probabilistic IR model (Okapi) Penalty is applied for documents that don’t satisfy Boolean query Named entity is more important than others  Failure analysis at NTCIR-8 ons ruc on o ame n y a a ase  Usage of Linked Open Data GeoNames : Large-scale database of geographic entities Wikipedia: Information resource for named entity Database construction  Geographic entities from GeoNames and Wikipedia • Add Japanese translation of GeoNames entry by using Wikipedia information [Takenaka, et. al., 2011] Other named entities from Wikipedia • Usage of DBPedia category information to identfy type of named entity (Person, Organization Place and Infrastructure) for making a list Identification of named entities and relationships between these entities and the query is important 「アフリカ(Africa)」⇔「コンゴ民主共和国(Democratic Republic of the Congo)」 Quality of pseudo relevant documents Topic drift Verb synonym expansion Some verbs are not important for Boolean query construction

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NTCIR-9 GeoTime at Osaka Kyoiku University - Toward Automatic Extraction of Place/Time Terms -

Our approach to NTCIR-9 Geotime was to obtain place/time information about topics from Wikipedia and Google using query terms extracted from topics. Adding this information to query terms, we retrieved documents using tag index and scored them. In addition, we compared tag of searched documents with time information, weighted the score value of documents retrieved, and ranked them...

متن کامل

NTCIR9-GeoTime Overview - Evaluating Geographic and Temporal Search: Round 2

ABSTRACT GeoTime for the NTCIR Workshop 9 is the second evaluation of Geographic and Temporal Information Retrieval called “NTCIR GeoTime”. The focus of this task is on search with Geographic and Temporal constraints. This overview describes the data collections (Japanese and English news stories), topic development, assessment results and lessons learned from this second NTCIR GeoTime task, wh...

متن کامل

University of Alicante at NTCIR-9 GeoTime

In this paper we present a complete system for the treatment of the geographical dimension in the text and its application to information retrieval. This system has been evaluated in the GeoTime task of the 9th NTCIR workshop, making it possible to compare the system with other current approaches to the topic. In order to participate in this task we have added to our GIR system the temporal dim...

متن کامل

NTCIR-8 GeoTime at Osaka Kyoiku University: Hierarchical Index for Geographic Retrieval

We retrieved topics that contained the geographic and temporal information at NTCIR-8 GeoTime task. Employing morphological analysis, temporal and geographic information are extracted from GeoTime collection. The index that represents a geographic hierarchy is made from the geographic information. In the experiment, we confirmed that the effect of the geographic hierarchical index when topics i...

متن کامل

SINAI at NTCIR-9 GeoTime: a filtering and reranking approach based solely on geographical entities

Geographic Information Retrieval (GIR) is an active and growing research area that focuses on the retrieval of textual documents according to a geographical criteria of relevance. In recent years, the IR research community has paid particular attention in IR systems that take into account temporal constraints. Temporal Information Retrieval (TIR) is a recent field which addresses the combinatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011